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American college football faces a conflict created by the desire to stage national 
championship games between the best teams of a season when there is no conven- 
tional playoff system to decide which those teams are. Instead, ranking of teams 
is based on their record of wins and losses during the season, but each team plays 
only a small fraction of eligible opponents, making the system underdetermined or 
contradictory or both. It is an interesting challenge to create a ranking system that 
at once is mathematically well-founded, gives results in general accord with received 
wisdom concerning the relative strengths of the teams, and is based upon intuitive 
principles, allowing it to be accepted readily by fans and experts alike. Here we 
introduce a one-parameter ranking method that satisfies all of these requirements 
and is based on a network representation of college football schedules. 

I. INTRODUCTION 

Inter-university competition in (American) football is big business in the United States. 
Games are televised on national TV; audiences number in the millions and advertising 
revenues in the hundreds of millions (of US dollars). Strangely, however, there is no official 
national championship in college football, despite loudly-voiced public demand for such a 
thing. In other sports, such as soccer or basketball, there are knockout competitions in 
which schedules of games are drawn up in such a way that at the end of the competition 
there is an undisputed "best" team — the only team in the league that remains unbeaten. A 
simple pairwise elimination tournament is the most common scheme. 

The difficulty with college football is that games are mostly played in conferences, which 
are groups of a dozen or so colleges chosen on roughly geographic grounds. In a typical 
season about 75% of games are played between teams belonging to the same conference. As 
a result there is normally an undisputed champion for each individual conference, but not 
enough games are played between conferences to allow an overall champion to be chosen 
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unambiguously. Some other sports also use the conference system, and in those sports an 
overall champion is usually chosen via a separate knockout tournament organized among 
the winners and runners up in the individual conferences. In college football, however, for 
historical and other reasons, there is no such post-season tournament. 

To fulfill the wishes of the fans for a national championship, therefore, several of the 
major conferences have adopted a system called the Bowl Championship Series (BCS, 
www.bcsf ootball . org), in which one of four existing post-season "bowl games" — the Rose, 
Sugar, Fiesta, and Orange Bowls — is designated the national championship game on a ro- 
tating basis and is supposed to match the top two teams of the regular season Q]. (For the 
2004 season it was the Orange Bowl; in the upcoming 2005 season it will be the Rose Bowl.) 
The problem is how to decide which the top teams are. One can immediately imagine many 
difficulties. Simply choosing unbeaten teams will not necessarily work: what if there are 
more than two, or only one, or none? How should one account for teams that play different 
numbers of regular-season games, and for "strength of schedule" — the fact that some teams 
by chance inevitably play against tougher opponents than others? What about margins of 
victory? Should a decisive victory against your opponent count for more than a narrow 
victory? Should home games be counted differently from away games? 

The problem of ranking competitors based on an incomplete set of pairwise comparisons 
is a well-studied one, both in football and other sports, and more generally Many 
different methods and algorithms have been proposed, frequently taking into account the 
issues mentioned above jll 0, Q, 0, [ll|, 3- Currently football teams are ranked using a 
weighted composite score called the BCS ranking that combines a number of these methods 
with polls of knowledge human judges. The formula used changes slightly from year to year; 
the most recent version averages six computer algorithms and two human polls. (One 
of them, the Associated Press (AP) poll, has opted out of the system starting from the 2005 
season.) There is, however, considerable unhappiness about the system and widespread 
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disagreement about how it should be improved 17]. There is, thus, plenty of room for 
innovation. 

In this paper we present a new method of ranking based on a mathematical formulation 
that corresponds closely to the types of arguments typically made by sports fans in comparing 
teams. Our method turns out to be equivalent to a well-known type of centrality measure 
defined on a directed network representing the pattern of wins and losses in regular-season 



FIG. 1: If team A has beaten team B, and team B has beaten team C, team A scores an indirect 
win over team C (indicated by the bold arrow). 

games. 

II. DEFINITION OF THE METHOD 

Perhaps the simplest measure of team standing is the win-loss differential, i.e., the number 
of games a team wins during the season minus the number it loses. (In American football 
there are no tied games — games are extended until there is a winner.) Indeed, the win-loss 
differential is almost the only measure that everyone seems to agree upon. It is unfortunate 
therefore that in practice it correlates rather poorly with expert opinions about which teams 
are best, for many of the reasons cited in the previous section, such as variation in strength 
of schedule. As we show here, however, we can correct for these problems, at least in part, 
by considering not just direct wins and losses, but also indirect ones. 

One often hears from sports fans arguments of the form: "Although my team A didn't 
play your team C this season, it did beat B who in turn beat C. Therefore A is better than 
C and would have won had they played a game." (See Fig. [TJ) I n i&ct, the argument is 
usually articulated with less clarity than this and more beer, but nonetheless we feel that 
the general line of reasoning has merit. What the fan is saying is that, in addition to a real, 
physical win (loss) against an opponent, an indirect win (loss) of the type described should 
also be considered indicative of a team's strength (weakness). It is on precisely this kind of 
reasoning that we base our method of ranking. 
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A. The college football schedule as a network 



The schedule of games for a season can be represented as a network or graph in which 
the vertices represent colleges and there is an edge between two colleges if they played a 
regular-season game during the season of interest Furthermore, we can represent the 
winner and loser of each game by making the network directed. We place an arrow on each 
edge pointing from the winner of the corresponding game to the loser. An example of such 
a network, for the 2004 season, is shown in FigEJ (The direction of the arrows is a matter of 
convention; we could have made the opposite choice had we wished and the network would 
still contain the same information.) 

Direct losses and wins of a team in this network correspond to edges running directly to 
and from that team, and indirect losses and wins, as defined above, correspond to directed 
paths of length 2 in the network, to and from the team. 

A particularly nice property of these indirect wins is that a direct win against a strong 
opponent — a team that has itself won many games — is highly rewarding, giving you auto- 
matically a large number of indirect wins. Thus, when measured in terms of indirect wins, 
the ranking of a team automatically allows for the strength of schedule. 

And there is no need to stop here: one can consider higher-order indirect wins (or losses) 
of the form A beats B beats C beats D, and so forth. These correspond to directed paths in 
the network of length three or more. Our proposed ranking scheme counts indirect wins and 
losses at all distances in the network, but those at greater distances count for less, because 
we feel it natural that a direct win against a team should count for more than the mere 
supposed victory of an indirect win. 

Mathematically, we can express these ideas in terms of the adjacency matrix A of the 
network, annxn real asymmetric matrix, where n is the number of teams (117 for Division I- 
A in the 2004 season), with element Aij equal to the number of times team j has beaten 
team i (usually or 1, but occasionally 2). The number of direct wins for a team can be 
written 




(1) 



j 



and the number of indirect wins at distance 2 (A beats B beats C) as 



indirect wins at distance 2 for team i = AkjA 

jk 



(2) 



FIG. 2: A graphical representation of the regular season schedule of division I-A teams in 2004. 
Teams are divided up by conference (A = Atlantic Coast, E = Big East, X = Big Ten, B = Big XII, 
C = Conference USA, M = Mid-American, P = Pac Ten, W = Mountain West, S = Southeastern, 
U = Sun Belt, T = Western Athletic, I = Independent). Directed edges point from winners to 
losers. 

and so forth. We discount indirect wins over direct ones by a constant factor a for every level 
of indirection, so that an indirect win two steps removed is discounted by a, an indirect win 
three steps removed by a 2 , and so forth. The parameter a will be the single free parameter 
in our ranking scheme. 

We now define the total win score Wi of a team i as the sum of direct and indirect wins 
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at all distances thus: 



Wi = Aji + a AjyA^ + a 2 ^ A hk A kj Aji + 

j kj hkj 



= J2i 1 + a J2 Ak j +a2 Yl A hkA kj + • • • ) 

= £(1 + o^) = fc° ut + a E 4^' ( 3 ) 
j j 

where &° ut is the out-degree of vertex z in the network — the number of edges leading away 

from the vertex. When written in this fashion, we see that the win score can also be viewed 

another way, as a linear combination of the number of games a team has won (the out-degree) 

and the win scores of the other teams that it beat in those games. 

Similarly the loss score U of a team is 

h = 22 A i:j + a 22 A ij A jk + « 2 X] A ij A jkAkh H 

j jk jkh 

= j2 A ij{ l + a J2 A j k+a2 J2 A 3 kAkh + '") 

j k kh 

= ^A u (l + a g = C + a^A^, (4) 

j j 

Now we define the total score for a team to be the difference Sj = w—li of the win and loss 
scores. Teams are then ranked on the basis of their total score. With this ranking scheme, 
a win against a strong opponent — one with a high win score — rewards a team heavily, while 
a loss against a weak opponent — one with high loss score — has the exact opposite effect. 
Thus, as discussed above, our ranking scheme automatically incorporates the strength of 
schedule into the scoring. 

Equations (j2J) and can conveniently be rewritten in vector notation, with w = 
w 2 , . . .), 1 = (h,l 2 , . . .), k out = (A; out !, A; out 2 , • • •) and k in = (A; in 1; A; in 2 , . . .), giving 

w = k out + «A T ■ w, 1 = k in + aA ■ 1, (5) 

or, rearranging, 

w=(/-«Ay 1 -k out , 1= (/ - aA) -1 • k in . (6) 



These formulas are closely relatec 
measure due to Katz and others 



to t 



rose for a well-known matrix-based network centrality 



8|,^3|, and our method can be regarded as a generalization 
of the Katz centrality applied to the network representation of the schedule of games. 
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Year 


•^max 


\-i 

max 


1998 


3.39401 


0.294637 


1999 


4.15120 


0.240894 


2000 


3.89529 


0.256720 


2001 


3.68025 


0.271721 


2002 


4.00933 


0.249418 


2003 


3.97901 


0.251319 


2004 


3.69253 


0.270817 



TABLE I: Eigenvalues A max and their inverses A max from regular seasons for the years 1998—2004. 

B. The parameter a 

Before applying our method we need to choose a value for the parameter a that appears 
in Eqs. (JHJ) and (|4"|) . A larger value of a places more weight on indirect wins relative to direct 
ones while a smaller one places more weight on direct wins. (For the special case a = only 
direct wins count at all and the total score for a team is simply the win-loss differential.) 

There are, in general, limits to the values a can take. It is straightforward to show that 
the series in Eqs. © and (@| converge only if a < A m ^ x , where A max is the largest eigenvalue 
of the adjacency matrix A. If the network is acyclic — has no loops of the form A beats B 
beats C beats A or longer — then the largest eigenvalue is zero (as indeed are all eigenvalues) 
and hence there is no limit on the value of a. This however is an unlikely situation: there has 
never yet been a season for which there were no loops in the network. Normally therefore 
there is a finite upper bound on a. Historically the values of this upper bound have been 
in the range 0.2 to 0.3 (Table HJ), so an indirect win cannot count for more than a fifth 
to a third of a direct win. However, the number of indirect wins is in general greater the 
farther out we go in the network, i.e., the higher the level of indirection. This means that 
the indirect wins can still make a substantial contribution to a team's score because of their 
sheer number. An a close to the upper bound gives roughly equal contributions to a team's 
score from indirect wins at all distances. 

Aside from the limit imposed by the requirement of convergence, a is essentially a free 
parameter, and different values will yield different final rankings of teams. As a simple 
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FIG. 3: The fraction of games won by (ultimately) higher-ranked teams in division I-A. 



criterion for judging which values are best, we calculate the rankings of all teams and then 
examine the retrodictive accuracy, the fraction of all games in a season that are won by the 
team with higher rank (as calculated from the final network after the season is complete) ^| . 
The results are shown as a function of a for each of the years for which the BCS has existed, 
1998-2004, in Fig. El We see that for a broad range of values of a our method ranks winners 
above losers about 80% of the time — a pretty good average — and the best results appear for 
values of a around 0.8 of the maximum allowed value. Thus a simple strategy would be just 
to choose a = 0.8 A~* x . 

While this strategy appears to give good results in practice, it has one problem, namely 
that the calculation of A max (and hence of a) requires a knowledge of the entire directed 
network, which means that we can only perform the calculation after the end of a season 
once the outcome of every game has been decided. In practice, however, one often wants 
to rank teams before the end of the season, for instance to calculate weekly standings from 
partial results as the season progresses. Thus we would like to be able to decide the value 
of a before the start of the season. In the next section we provide a method for doing this 
that appears to work well. 



9 



C. An algorithm for choosing a 



As discussed in the preceding section, the limit A~^ x on a would be infinite were there no 
loops in the network. Only if there are loops (which there usually are) does the limit become 
finite. And in general the more loops there are the lower the limit. (The combinatorial 
explosion in the combinations of loops that paths can circulate around makes the number of 
paths increase faster with path length when there are more loops, and this then requires a 
lower value of a to ensure convergence.) Real schedule networks have fewer loops than one 
would expect on the basis of chance, precisely because teams do vary in strength which gives 
rise to networks that are close to being acyclic. Thus we would expect the value of A max to 
be lower and the limit A~^ x to be higher in a real network than in a network with randomly 
assigned edges. (As a check, we have performed Monte Carlo randomizations of the edge 
directions in the real networks and find that indeed A max consistently increases when we do 
this.) This provides us with way to calculate a safe and practical upper bound on the value 
of a without knowledge of game outcomes: we simply calculate the limit for a network with 
randomly assigned outcomes. 

It is straightforward to calculate the largest eigenvalue for a random directed network 
in which the distribution of in- and out-degrees is known. Let P(k m = i, k ont = j) be the 
joint probability distribution of in- and out-degrees. The largest eigenvalue is equal to the 
factor by which the number of paths of a given length starting at a vertex increases when 
we increase that length by one, in the limit of long path length. But this is simply equal 
to the mean out-degree of the vertex reached by following a randomly chosen edge in the 
graph, which is given by jl^ 



For our random network, the joint degree distribution is derived by randomly assigning 
directions to edges on an initially undirected network whose degree distribution is given by 
the distribution of the number of games played by the teams in the regular season. Let us 
denote this distribution Then 



= 3 



/) (A: in ) 




(7) 
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Year 


A max from MC simulation 


((k 2 ) ~ (k))/2(k) 


1998 


4.957 ±0.068 


4.935 


1999 


4.901 ±0.066 


4.882 


2000 


4.927 ± 0.066 


4.896 


2001 


4.896 ± 0.065 


4.875 


2002 


5.350 ±0.069 


5.334 


2003 


5.277 ±0.064 


5.260 


2004 


4.859 ±0.065 


4.838 



TABLE II: Comparison of the values of A max calculated from Monte Carlo simulations and using 
Eq. © for the years 1998-2004. 

and so our expression for the largest eigenvalue is 

. Eg = o ij2-^ (7>,„ Ego Eg* ij± - (Dp* 
max E5=o«- (iW) (^Ki ^ E^oE^ 

E^,2-* Pjfc EL<(*-0O _ Er= 2- fc ^-i)2 fc - 2 



Er=o2- fe P,Eto^C) 

2(k) ' 



Er= 2-v^2 



fc-1 



(9) 



where (k) and (A; 2 ) are the mean and mean-square number of games played by a team in 
the season of interest. 

As a test of this calculation we have calculated numerically the actual values of A max 
for simulated seasons with randomly assigned wins and losses. The results are shown in 
Table ITT1 As the table shows, agreement between the analytic calculation and the simulations 
is excellent. 

All the values of A max in Table HI are larger by about 20% than the actual A max for the 
corresponding season (Table HJ), precisely because actual wins and losses are not random, 
but reflect the real strengths and weaknesses of the teams. But this means that the random- 
graph value of A max imposes a limit on a that will in general be about 0.8 of the limit derived 
from the true final schedule network incorporating the real wins and losses. And this value 
is right in the middle of the region found in the preceding section to give the best rankings 
of the teams. Thus an elegant solution to the problem of choosing a emerges. We simply 
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choose a value equal to the limiting value set by Eq. Q: 

a =wrw>- (10) 

This guarantees convergence, requires no knowledge of the eventual outcome of games, and 
appears to give optimal or near-optimal rankings under these constraints. This then is the 
value that we will use in the calculations in this paper. 



D. Comparison with the BCS rankings 

We now compare the results of our method with the official BCS ranking results. It is 
worth pointing out that agreement with the official rankings is not necessarily a sign of 
success for our method. If our method were better in some sense than the official method 
(for example in comparison with the opinions of human expert judges), then necessarily the 
two methods would have to disagree on some results. Nonetheless, since the BCS ranking 
is, by common consent, officially the collective wisdom, it is clearly of interest to see how 
our method compares with it. 

First, in Table ITTTI we show the rankings calculated from our method and from the BCS 
computer algorithms for the top 25 BCS teams of 2004. The value of a for 2004 from 
Eq. (fTUjl is 0.207, or about 0.763 of the maximum. 

Even from a casual inspection it is clear that there is a reasonable match between our 
rankings and the official ones. For instance, the correlation coefficient between the two sets 
of rankings is 0.90. Given the simplicity of our method, it is pleasantly surprising that the 
rankings are in such good agreement with other far more complicated algorithms. 

Among these 25 teams, our method classified two — Pittsburgh and Florida — to be outside 
the top 25. Interestingly, these same teams were also ranked outside the top 25 by all the 
BCS computer algorithms except the Billingsley algorithm. Furthermore, none of the other 
computer polls does any better at predicting the final top 25 — each gets at least two wrong. 
Other points of interest are the ranks of the Universities of Auburn, Texas, and California. 
Auburn, although undefeated in the regular season, did not participate in a championship 
game because it was consistently ranked third in all polls, and our method concurs. Texas 
and California played very similar seasons but both human polls ranked California to be 
higher, while all the computer polls said the reverse. Our method lines up with the computer 
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BCS 


School 


Our method 


BCS Computers 


1 


Southern California 


2 


2 


2 


Oklahoma 


1 


1 


3 


Auburn 


3 


3 


4 


Texas 


4 


4 


5 


California 


8 


6 


6 


Utah 


5 


5 


7 


Georgia 


16 


8 


8 


Virginia Tech 


6 


T-9 


9 


Boise State 


7 


7 


10 


Louisville 


11 


13 


11 


Louisiana State 


15 


T-9 


12 


Iowa 


10 


12 


13 


Michigan 


14 


17 


14 


Miami (FL) 


9 


T-14 


15 


Tennessee 


17 


T-14 


16 


Florida State 


12 


21 


17 


Wisconsin 


20 


20 


18 


Virginia 


18 


18 


19 


Arizona St. 


13 


11 


20 


Texas A&M 


19 


16 


21 


Pittsburgh 


27 




22 


Texas Tech 


23 


22 


23 


Florida 


26 




24 


Oklahoma State 


21 


19 


25 


Ohio State 


22 





TABLE III: Comparison of standings for the final top 25 BCS teams in 2004, calculated using the 
method described in this paper and the standard BCS composite computer ranking, "-"denotes 
a team that was not ranked among the top 25 in the BCS composite computer ranking and "T" 
denotes a tied rank. 
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2003 


2002 


2001 


Our method 


BCS 


Our method 


BCS 


Our method 


BCS 


Oklahoma 


Oklahoma 


Ohio St. 


Miami (FL) 


Tennessee (6) 


Miami (FL) 


Southern Cal 


Louisiana St. 


Southern Cal 


Ohio St. 


Miami (FL) 


Nebraska 


Florid St. (7) 


Southern Cal 


Miami (FL) 


Georgia 


Illinois (8) 


Colorado 


Louisiana St 


Michigan (10) 


Georgia 


Southern Cal 


Colorado 


Oregon (6) 


Miami (FL) (9) 


Ohio St. (6) 


Oklahoma (7) 


Iowa (8) 


Nebraska 


Florida (7) 


2000 


1999 


1998 


Our method 


BCS 


Our method 


BCS 


Our method 


BCS 


Washington 


Oklahoma 


Florida St. 


Florida St. 


UCLA 


Tennessee 


Oklahoma 


Florida St. 


Mich. St. (9) 


VA Tech (6) 


Florida St. 


Florida St 


Oregon St. (6) 


Miami (FL) (8) 


Nebraska 


Nebraska 


Texas A&M (6) 


Kansas St. 


Florida St. 


Washington 


Michigan (8) 


Alabama 


Tennessee 


Ohio St. (7) 


Oregon (10) 


VA Tech (15) 


Alabama 


Tennessee (8) 


Kansas St. 


UCLA 



TABLE IV: Comparison of the top five teams calculated using the method presented in this paper 
and using the complete BCS composite ranking (including human polls) for the years 1998-2003. 
Numbers in parentheses for our method denote teams' ranks under BCS, and vice versa. 



polls in this respect. 

In Table IIVI we compare the top five BCS teams for each year (with a again selected as 
described in Section III CI and taking values typically between 0.7 and 0.85 of the maximum 
allowable value). The rankings consistently agree on at least three of the top five teams in 
each year. 

For a more quantitative comparison of our method with the official BCS rankings we have 
also examined the retrodictive accuracy, as denned earlier. Since the BCS announces only 
the final top 15 or 25 teams in its rankings each year, we have constrained our calculation 
of the retrodictive accuracy to the games played among those teams. The results are given 
in Table El 
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2004 


2003 


2002 


2001 


2000 


1999 


1998 


Our method 


0.81 


0.62 


0.89 


0.47 


0.65 


0.67 


0.83 




35/43 


23/37 


16/18 


7/15 


15/23 


8/12 


15/18 


BCS 


0.84 


0.69 


0.79 


0.47 


0.75 


0.69 


0.83 




29/37 


25/36 


15/19 


8/17 


12/16 


11/16 


10/12 



TABLE V: Retrodictive accuracies and the numbers of games played among the top 25 (2004-2003) 
or 15 (2002-1998) teams for our method and for the official BCS rankings. 

III. A NON-FOOTBALL EXAMPLE APPLICATION 

The discussions so far in this paper have focused on the problem of ranking teams in 
American college football, but our method could in principle be used for a wide variety of 
other ranking problems in which individuals, groups, or objects are compared in a pairwise 
fashion. Such problems come up in very many fields of scientific interest. Here we give one 
example from biology, of dominance hierarchies among animals. 

In a study published in 1979, Lott observed the pattern of dominant and submissive 
behaviors between 26 male American bison in Montana over a period of about a month. 
Pairs of bison engaged in aggressive interactions to establish dominance within the herd and 
the outcome of each observed interaction was noted, creating a directed network of "wins" 
and "losses" between pairs of animals, just as in the college football case. Lott also observed 
the breeding success of the bison over the same period, as measured by the number of mating 
events between the male bison involved in the dominance hierarchy and the cows (who do 
not engage in the aggressive interactions). 

We used our method to calculate a ranking for the bison, using a value of a = 0.124 from 
Eq. (JTUj). We find that the resulting rankings are highly correlated with breeding success of 
the bison, with a correlation coefficient of 0.611. Correlation between status and breeding 
success is a widely understood feature of breeding populations our method provides 

a quantitative confirmation in this particular case, and could in principle be applied to other 
examples for which similar data are available. 

One can also represent the correlation between rank and breeding success using a so- 
called Lorenz curve — a device often used to represent inequality in the distribution of wealth 
between the richest and poorest individuals. We show a Lorenz curve for our bison example 
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FIG. 4: A Lorenz curve (red) of breeding events versus the rankings of the bison. It is placed 
consistently above the equality line (green) which represents a perfectly even distribution, indicating 
a positive correlation between breeding power and dominance hierarchy. 

in Fig. HJ The curve is a plot of the fraction of mating events that involve bison of a given 
rank or higher, as a function of rank. If mating success were independent of rank, the curve 
would follow the 45° line, but instead it deviates markedly from the line and the size of this 
deviation is a measure of the extent to which higher ranked bison are more successful in 
mating. (The area between the 45° line and the curve is called the Gini coefficient, and is 
sometimes quoted as a measure of inequality, although in this case we feel the correlation 
coefficient of rank and mating success is a more easily understood measure of the extent to 
which our rankings predict success.) 



IV. CONCLUSIONS 



In this paper we have introduced a ranking system for division I-A American college 
football based on a common-sense argument frequently made by sports fans (and not limited 
to American football). The method has an elegant mathematical formulation in terms of 
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networks and linear algebra that is related to a well-known centrality measure for networks. 
The method has one free parameter and we have given empirical evidence indicating the 
typical range of best values for this parameter and a method for choosing a value in any 
particular case. 

Applying our method to the seven years during which the BCS ranking scheme has ex- 
isted, we find excellent agreement between the method and the official rankings but with 
some deviations, particularly in well-known controversial cases. We believe that the com- 
bination of sound and believable results with a strong common-sense foundation makes our 
method an attractive ranking scheme for college football. Our method can be applied to 
other ranking problems outside of college football as well, including other games or sports, 
or other problems entirely. We have given one example application to a dominance hierarchy 
in a herd of American bison. 

Finally, we would like to comment on the mathematical generalizability of our method. 
The method lends itself readily to the addition of other elements, such as margin of victory, 
home-field advantage, progress of the season, and so forth: these could be introduced as 
modifiers on the weights of the edges in the network, and it would be interesting to see how 
these affect the outcome of the method. However, we believe that the very simplicity of the 
current method, with its single parameter, is a substantial advantage, and that simplicity 
should be encouraged in these types of mathematical methods. A method such as ours 
reduces the extent to which the calculations must be tuned to give good results while at the 
same time retaining an intuitive foundation and mathematical clarity that makes the results 
persuasive. 
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